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Abstract 

Consider a random regular graph with degree d and of size n. Assign to each edge 
an i.i.d. exponential random variable with mean one. In this paper we establish a precise 
asymptotic expression for the maximum number of edges on the shortest-weight paths 
between a fixed vertex and all the other vertices, as well as between any pair of vertices. 
Namely, for any fixed d > 3, we show that the longest of these shortest-weight paths has 
about a log n edges where a is the unique solution of the equation a log ffzf a:) — a = ^| , 
fora> 3=1. 



1 Introduction 

The focus of this paper is on first passage percolation on a random regular graph, namely 
on G ~ Q(n,d), a graph uniformly distributed over the set of all graphs on n vertices [n] := 
{1, . . . , n}, in which every vertex has degree d, for d > 3 and n large. We assume that each 
edge in this graph has an i.i.d. exponential weight with mean one. We consider the shortest- 
weight paths between any pair of vertices of this graph, and establish that the longest of these 
shortest- weight paths has about a log re edges for some positive constant a depending on d that 
we will shortly define. We also derive a similar precise asymptotic expression for the maximum 
number of edges on the shortest- weight paths between a fixed vertex and all the other vertices, 
see Theorem [T] for the exact statement. 

Let G = (V, E, w) be a weighted graph, defined as the data of a graph G = (V, E) and a 
collection of weights w = {w e } e< =E associated to each edge e G E. For two vertices a, b G V, 
the weighted distance between a and b is given by 

dist w (a,b) = min N w e , 

■K&n(a,b) 

where the minimum is taken over the set II(a, b) of all paths between a and b in the graph. For 
a, b £ V we denote by 7r(a, b) the shortest-weight path between a and b. 
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We define the function / : K + — > R as follows 

/(a):=alog(^a)-a + ^. (1) 

Note that f'(a) = log (j^otj is positive for a > and / (3=2) = — 1- 

We let a* and 2 be respectively the unique solutions to f(a) = and f(a) = 1 for a > 

The main result of this paper is the following theorem. 

Theorem 1. Fix d > 3 and let G ~ G(n,d) be a weighted random d-regular graph with n 
vertices and i.i.d. rate one exponential variables on its edges. Then, as n — )■ 00, we have 



max i6[n] |vr(l, j)\ p 



logra 

and 



(2) 



a, (3) 



logn 

where — > denotes the convergence in probability. 

In order to compare our result with the existing ones, we reproduce here a result of Bhamidi, 
van der Hofstad and Hooghiemstra [8] concerning the number of edges in the shortest-weight 
path between two uniformly chosen nodes (as well as the weighted distance); see also [23] for 
the joint distribution of (weighted) distances in random regular graphs. Remark that, the 
following theorem is stated in [8] in a more general setting (random graphs with i.i.d. degrees). 

Theorem 2 (Bhamidi, van der Hofstad and Hooghiemstra [8]). Fix d>3 and let G ~ Q(n,d) 
be a random d-regular graph with n vertices and i.i.d. rate one exponential variables on its 
edges. Then, as n — > 00, 

|tt(1,2)| -7logn d) z , 4 » 



\Ay log n 

where Z has a standard normal distribution, 7 = jr^, and denotes the convergence in 
distribution. Furthermore, there exists a non- degenerate random variable W such that 

distal, 2)-—^ log n A W. (5) 



By the above theorem, the ratio of the length and the weight along a shortest-weight path 
between two (uniformly chosen) nodes is asymptotically d — 1 while this ratio for a minimum 
length path between two nodes is asymptotically 1. Our proof of Theorem [1] (see Section 2]) 
implies that, there exists with high probability (that is, with probability tending to 1 as n — > 00) 
shortest- weight paths of length about a log n whose total weight is about 1°§ n (typical 
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weighted distance between two uniformly chosen nodes). This means that, for these paths, the 
ratio of the length and the weight is even larger, i.e., asymptotically (d — 2)3! 



For completeness, we also include results of Ding, Kim, Lubetzky and Peres [13] concerning 
the weighted diameter in random regular graphs; see also [1] for a generalization. 

Theorem 3 (Ding, Kim, Lubetzky and Peres [13]). Fix d > 3 and let G ~ G(n, d) be a random 
d-regular graph with n vertices and i.i.d. rate one exponential variables on its edges. Then, as 
n — > oo, we have 

max jg[n ] dist w (l,j) p 1 1 

logn ~^ d-2 + d' U 

and 

max iieW dist w (i, j) p 1 2 . . 

— — ^ h -• (7) 

log n d — 2 d 

In particular, the result of [13] implies that there exists with high probability shortest- 
weight paths of length about log n (the same as the length between two uniformly chosen 

nodes, see Theorem [2|) whose total weight is about + |) logn. This result is used in [3] 
to analyze an asynchronous randomized broadcast algorithm for random regular graphs. 



Related work. First passage percolation model has been mainly studied on lattices moti- 
vated by its subadditive property and its link to a number of other stochastic processes, see e.g., 
\\.7\ [20] [T8] for a more detailed discussion. First passage percolation with exponential weights 
has received substantial attention, in particular on the complete graph [16] HU [2] [1] [12] [22] . 
and more recently on random graphs [7] [8] [9] [131 IH E] • Iri particular, Janson [19] considered the 
case of the complete graph with fairly general i.i.d. weights on edges, including the exponential 
distribution with parameter one. It is shown that, when n goes to infinity, the asymptotic 
distance for two given points is log n/n, that the maximum distance if one point is fixed and 
the other varies is 2 log n/n, and the maximum distance over all pairs of points is 3 log n/n. He 
also derives asymptotic results for the corresponding number of hops or hopcount (the number 
of edges on the paths with the smallest weight). It is shown that (when n goes to infinity) the 
number of hops is logn for two given nodes, and the maximum hops if one point is fixed and 
the other varies is elogn. More recently, Addario-Berry, Broutin and Lugosi [1] showed that 
the longest of these shortest-weight paths in a complete graph has about 5 logn edges where 
5 ~ 3.5911 is the unique solution of the equation alog(a) — a = 1, which answered a question 
posed by Janson [19]. Note that a* — > e and a — > a as d — > 00. 

Organization of the paper. The remainder of the paper is organized as follows. In the next 
section we provide several preliminary facts on random regular graphs. We also consider in this 
section the exploration process for configuration model which consists in growing balls (neigh- 
borhoods) simultaneously from each vertex. In addition, the section provides some necessary 
notations and definitions that will be used throughout the paper. Sections [3] and H] form the 
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heart of the proof. We first prove that the above bound is an upper bound in Sections El The 
final section provides the corresponding lower bound using the second moment method, applied 
to a suitably defined set of shortest paths with special properties that make them amenable to 
analysis. 

Basic notations. Let {X n } n ^fq be a sequence of real-valued random variables on a sequence 
of probability spaces Pn)} n eN- If c G M is a constant, we write X n A c to denote that X n 
converges in probability to c. That is, for any e > 0, we have P n (|X n — c| > e) — > as n — > oo. 
Let {a n } ng N be a sequence of real numbers that tends to infinity as n — > oo. We write X n = 
o p (a n ) if \X n \/a n converges to in probability. Additionally, we write X n = O p (a n ) to denote 
that for any positive- valued function u(n) — > oo, as n — > oo, we have P(|X n |/a n > w(n)) = o(l). 
If £ n is a measurable subset of Q n , for any n £ N, we say that the sequence {£ n }neN occurs 
with high probability (w.h.p.) if P(£ n ) = 1 — °(1)> as n — )■ oo. 

The notation B'\n(k,p) denotes a binomially distributed random variable corresponding to the 
number of successes of a sequence of k independent Bernoulli trials each having probability of 
success equal to p. 

We recall here that for two real- valued random variables A and B, we say A is stochastically 
dominated by B and write A < s t B if for all x, we have P(^4 > x) < P(B > x). If C is another 
random variable, we write A < s t {B \ C) if for all x, F(A > x) < ¥(B > x \ C) almost surely. 

2 Preliminaries 

2.1 Configuration model 

We recall first the setup of the configuration model (CM), as introduced by Bender and 
Canfieldp] and Bollobas [ID] . To construct a graph using this method, to each of the n (even) 
vertices allocate al distinct half-edges, and select a uniform perfect matching on these points. 
When a half-edge of i is paired with a half-edge of j, we interpret this as an edge between i 
and j. 

The random graph obtained following this procedure may not be simple, i.e., may contain 
self- loops due to the pairing of two half-edges of i, and multi-edges due to the existence of more 
than one pairing between two given nodes. Conditional on the event that the graph produced is 
simple, it is uniformly distributed over the set of all d-regular graphs on n vertices. The prob- 
ability of this event is uniformly bounded away from zero, equivalent to (1 + o(l)) exp ( J 



as n tends to infinity [24]. Hence, any event that holds w.h.p. for the graph obtained via the 
configuration model also holds w.h.p. for G ~ Q(n,d). 

Note that the assumption d > 3 implies that G ~ Q(n,d) is connected with high probabil- 
ity \10\ 124] . We will assume this in what follows. 

The advantage of using the configuration model is that it allows one to construct the graph 
gradually, exposing the edges of the perfect matching one at a time. This way, each additional 
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edge is uniformly distributed among all possible edges on the remaining (unmatched) half-edges. 
2.2 Neighborhoods and tree excess 

For a,b G V, let dist(a, b) = distc(a, b) denote the typical distance between a and b. For a 
vertex a G V and an integer number m, the m-step neighborhood of a, denoted by B(a,m) 
and its boundary dB(a, m), are defined as 

B(a, m) := {v G V | dist(a, v) < m}, and, dB(a, m) := B(a, m)\B(a, m — 1). (8) 

For a vertex a € V and a real number t > 0, the t-radius neighborhood of a in the weighted 
graph, or the ball of radius t centered at a, is defined as 

B w (a, t) : = { b, dist u ,(a, b) < t }. 

The first time t where the ball B w (a, t) reaches size k + 1 > 1 will be denoted by T^(a), i.e., 
T k (a) = mm{t : \B w (a,t)\ >k+l}, T (a) = 0. 

Note that there is a vertex in B w (a, T^(a)) which is not in any ball of smaller radius around 
a. When the weights are i.i.d. according to a random variable with continuous density, this 
vertex is in addition unique with probability one. We will assume this in what follows. Let 
Vk(a) denote this node. Furthermore, let H^a) denote the number of edges (hopcounts) in the 
shortest path between the node a and Vk(a), i.e., the generation of Vk(a). 

For a connected graph F, the tree excess of F is denoted by tx(F), which is the maximum 
number of edges that can be deleted from F while still keeping it connected. By an abuse of 
notation, for a subset W C V, we denote by tx{W) the tree excess of the induced subgraph 
G[W] of G on W. (If G[W] is not connected, then tx(W) := oo.) 

We need the following lemma which demonstrates the well known locally treedike properties 
of G ~ G(n, d) for d > 3. 

Lemma 4. Let G ~ G(n,d) for some fixed d > 3, and let m = [^log d _in\ . Then w.h.p., 
tx(B(u,m)) < 1 for all u G V(G). 

Proof See [HJ Lemma 2.1] . □ 

Consider now the growing balls B w (a, Tk{a)) for < k < n — 1 centered at a and let Xk{a) 
be the tree excess of B w (a,T/ ( .(a)), i.e., 

X k (a) :=tx{B w {a,T k (a))). 

The number of edges crossing the boundary of the ball B w (a,T k (a)) is denoted by S k (a). 
A simple calculation shows that (for G ~ G(n,d)) 

S k (a) = d+(d-2)k-2X k (a). (9) 
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2.3 Shortest-weight paths on a tree 

Assume we have positive integers d\ , d,2, ■■ ■ ■ We consider the following construction of a branch- 
ing process (with these degrees) in discrete time: 

• At time 0, start with one alive vertex (the root); 

• At each time step k, pick one of the alive vertices at random, this vertex dies giving birth 
to dk children. 

This type of random tree is known as (random) increasing trees which have been well- 
studied, see e.g. [T5] . We will need the following basic result, the proof of which is easy 
and can be found for example in [5J Proposition 4.2]. Let := d\ + ... + — (k — 1). 

Lemma 5. Pick an alive vertex at time k > 1 uniformly at random among all vertices alive at 
this time. Then, the generation of the k-th chosen vertex is equal in distribution to 

i=i 

where are independent Bernoulli random variables with parameter 

m = i) = -. 

Si 

In what follows, instead of taking a graph at random and then analyzing the balls, we use 
a standard coupling argument in random graph theory which allows to build the balls and the 
graph at the same time. Fix two vertices, say u and v. We grow the balls around these vertices 
simultaneously at rate 1, so that at time t, B w (u,t) and B w (v,t) are the constructed balls 
from u and v. When these two balls intersect via the formation of an edge (it*,?;*) between 
two vertices u* G B w (u, .) and v* G B w (v, .), then the shortest-weight path between the two 
vertices has been found. Furthermore, we have 

\ir(u,v)\ = \tt(u,u* v )\ + \tt(v,vI)\ + 1. 

2.4 The exploration process 

Fix a vertex a, and consider the following continuous-time exploration process. At time t = 0, 
we have a neighborhood consisting only of a, and for t > 0, the neighborhood is precisely 
B w (a,t). We now give an equivalent description of this process. 

• Start with B = {a}, where a has d half-edges. For each half edge, decide (at random 
depending on the previous choices) if the half-edge is matched to a half-edge adjacent 
to a or not. Reveal the matchings consisting of those half-edges adjacent to a which are 
connected amongst themselves (creating self-loops at a) and assign weights independently 
at random to these edges. The remaining unmatched half-edges adjacent to a are stored 
in a list L. (See the next step including a more precise description of this first step.) 
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• Repeat the following exploration step as long as the list L is not empty. 

Given there are I > 1 half-edges in the current list, say L = (h\, . . . , he), let \& ~ Exp(^) be 
an exponential variable with mean £ _1 . After time \t select a half-edge from L uniformly 
at random, say hi. Remove hi from L and match it to a uniformly chosen half-edge in the 
entire graph excluding L, say h. Add the new vertex (connected to h) to B and reveal 
the matchings (and weights) of any of its half-edges whose matched half-edge is also in 
B. More precisely, let 2x be the number of already matched half-edges in B (including 
the matched half-edges hi and h). There is a total of dn — 2x unmatched half-edges. 
Consider one of the d — 1 half-edges of the new vertex (excluding h which is connected to 
hi ) ; with probability {i — 1) /{dn — 2x — 1) it is matched with a half-edge in L and with the 
complementary probability it is matched with an unmatched half-edge outside L. In the 
first case, match it to a uniformly chosen half-edge of L and remove the corresponding 
half-edge from L. In the second case, add it to L. We proceed in the similar manner for 
all the d — 1 half-edges of the new vertex. 

To verify the validity of the above process, let Bt(a) and L(a,t) be respectively the set 
of vertices and the list generated by the above procedure at time t, where a is the initial 
vertex. Considering the usual configuration model and using the memoryless property of the 
exponential distribution, we have B w (a,t) = Bt(a) for all t. To see this, we can continuously 
grow the weights of the half-edges h\, . . . , hi in L until one of their rate 1 exponential clocks fire. 
Since the minimum of t i.i.d exponential variables with rate 1 is exponential with rate I, this 
is the same as choosing uniformly a half-edge h% after time \& (recall that by our conditioning, 
these t half-edges do not pair within themselves). Note that the final weight of an edge is 
accumulated between the time of arrival of its first half-edge and the time of its pairing (except 
edges going back into B whose weights are revealed immediately) . Then the equivalence follows 
from the memoryless property of the exponential distribution. 

Note that Tj(a) is the time of the i-th exploration step in the above continuous-time ex- 
ploration process. Assuming L(a,Ti(a)) is not empty, at time Tj + i(a), we match a uniformly 
chosen half-edge from the set L(a,Ti(a)) to a uniformly chosen half-edge among all other half- 
edges, excluding those in L(a,Ti(a)). Let Ft be the c-field generated by the above process 
until time t. Given J 7 ^.^), Tj+i(a) — Tj(a) is an exponential random variable with rate Si(a) 
given by Equation ([9]) which is equal to \L(a, Tj(a))| the size of the list consisting of unmatched 
half-edges in B T .r a \(a). In other words, 

(T i+1 (a) - Ti(a) | F Ti {a)) = Exp(5i(a)), 

this is true since the minimum of k i.i.d. rate one exponential random variables is an exponential 
of rate k. 

We will need the following coupling lemma the proof of which can be found in [HI Proposition 
4.5]. 

Lemma 6 (Coupling shortest- weight graphs on a tree and CM). For a uniformly chosen vertex 
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u, we have (for all k > 1) 

k 

d 



H k (u) = y^ji, 



i=l 



where {ij}?^ are independent Bernoulli random variables with parameter 



Si(u) 

and Si(u) is given by Equation^ 

3 Proof of the upper bound 

In this section we present the proof of the upper bound for Theorem [TJ 

As described above, we grow the balls around each vertex simultaneously (at rate one) so 
that at time t, Bt(a) = B w (a, t) is the ball constructed from vertex a. 

We let q := \_2yJ dn log n\ . The following lemma says that for all vertices u and v, the 
growing balls centered at u and v intersect w.h.p. provided that they contain each at least q 
nodes. More precisely, 

Lemma 7. We have with high probability 

B w (u, T q (u)) n B w (v, T q (v)) y£ 0, for all u and v. (10) 

For the sake of readability, we postpone the proof of the lemma to the end of this section. 
Fix two vertices u and v. Let 

C(u,v) : = min{fc > : B w (u,T k (u)) n B w (v,T k (v)) ± 0}, 

be the first time that B w (u,T(u)) and B w (v,T(v)) share a vertex. Thus, by the above lemma 
w.h.p. C(u, v) < q for all u and v. Let us denote by Q the following event: 

Q := {C(u, v) < q for all u and v}. 

Consider now the exploration process started at a vertex u. We will need to find lower 
bounds for S k (u) in the range 1 < k < q. We let r := [(logn) 3 J. 

By the uniform choice of the matching, for every k > 0, the number of half-edges introduced 
by the new vertex at time T k +i(u) and connecting back to B w (u,T k {u)) (given Fx k ( u )) is 
stochastically dominated by a binomial variable 

B\n(d — 1, a), where a = ; ; < . 

dn — 2k n 

where the above inequality is valid for k < ^ — 5. Therefore, the tree excess of B w (u,T k (u)) is 
stochastically dominated by a binomial variable B\n(dk, ^jjp)- 
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We have (for large n) 

P(X r {u) > 2) < P ( Bin (dr, > 2] < ofy = o(n~ 3 / 2 ). (11) 



n 



Moreover, for any k satisfying r < k < 2q, we have by Chernoff's inequality 

H{X k (u) > k/^}) < P (^Bin (dk, > k/V^j < exp f~k/y/r\ < n~ 5 , (12) 

for any sufficiently large n, since k 2 jn = o(k/y/r). 
We conclude by a union bound over all r < k < 2q, 

F({X k (u) < k/y^r, for all r < k < 2q}) > 1 - o(n" 4 ). 

Define the event 

R u := {X r (u) < 1, and X k (u) < k/y/r, for all r <k< 2q}, (13) 
such that ¥(R U ) > 1 — o(n~ 3 / 2 ) by above inequalities. 
Thus defining R : = n«e[n] we ^ ^ union bound 

F(R) > 1 - o^ 1 ' 2 ). 

Consider now two uniformly chosen vertices u and v. We have 

I Q) < st {H q (u) + H q {v) I Q). 

Furthermore, we have 

1 

(H q (u) I R,Q) < st n:=^Ii, 



i=l 



where {Ii}i^Li are independent Bernoulli random variables with parameter 

d- 1 



for all 1 < i < r, and 

for all r < i < g. 
We conclude 



= 1) 

U = 1) 



1 + (d- 2)V 
d-1 



1 + (d- 2)i - lijy/r' 1 



(\ir(u, v)\\R, Q) < st Hi + H 2 , (14) 
where T-L± and %2 are two independent copies of T~L defined above. 
We have the following lemma. 



Lemma 8. We have for some constant C (depending only on d) 

P(7*l+% > a*(logn + loglogra)) < C(nlogn) -1 , 

and, 

¥{Tii +H 2 > 3(logn + log log n)) < C(n log n)~ 2 . 

We postpone the proof of this lemma to the end of this section. 
We conclude by ()14|) . Lemma [8] and union bound that 

P (max|7r(l,j)| > a*(logn + loglogn) ) < F(R C ) + P(Q C ) + CI log n, 
VjgN ) 

and, 

P ( max \Tr(i,j)\ > S(logn + loglogn) ) < P(R C ) + P(Q C ) + CI log 2 n. 
\i,je[n] J 

Since R and Q hold with high probability, we get (w.h.p.) 

max|7r(l,j)| < a*(logn + loglogn), 

je[n] 

and, 

max |7r(i,j)| < S?(logn + loglogn). 
i,je[n] 

This completes the proof of the upper bound for Theorem [TJ 

We end this section by presenting the proof of Lemma [7] and Lemma [8l 

Proof of Lemma^ Fix two vertices u and v. First consider the exploration process for B w (u, t) 
until reaching t = T q (u). We know that w.h.p. the event R holds. Conditioned on R we have 

S q (u) > (d-l-o(l))q. 

Next, consider the exploration process started at v. Each matching adds a uniform half- 
edge to the neighborhood of v. Therefore, the probability that B w {v,T q {v)) does not intersect 
B w (u,T q (u)) is at most 

r _ (d-i-^(i))q ^ q ^ exp (4(d _ l _ o(i)) log n) < 

for any large n. A union bound over u and v completes the proof. 

□ 
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Proof of Lemma We have for A > 0, 



Ee ,K,.n( 1+(e ,_ 1)T ^_) n ( 1+( ^- 1)T 

i=l i=r+l 

Then using the fact that log(l + x) < x, we obtain 



d-l 



+ (d-2)i-2i/y/r y 



+ g log ( 1 + (e ,- 1) i + (d _y_ 2i/yF ) 



i=r+l 

/ r 



d-l ^ d-l 



" I i + ( d -2)i + ^ 1 + (d - 2)i - 2z/^ 



< (e ; 



/ d-l Al d-l 1 A A 

Md-2^7 + d- 21- 2r-V2 2^ J I 
V i=l i=r+l / 



^ (e A -l)^(l + 0(r" 1/2 ))(lo g(? + 2) 
^ (e A -l)^(logg + 3). 



Recall that g = |_2v 'dn log nj . Choosing A := log ( ^zf a* ) , we get 



log Ee A(Wl+W2) < ^a* - ^ — ^ ( lo g n + lo g lo S n + log d + 10). 

By Markov's inequality we have 
F(Hi + H 2 > a*(logn + loglogn)) < Ee A(Wl+W2) exp(-Aa*(log?i + log log n)) 

< exp ^(a* — - — -)(logn + log log n + logd + 10) 

exp ( — Q*log(- a*) (log n + log log n] 



Cexp((a*-^-I-a*log (^T*))) 

exp (log n + log log n) 
C exp (— log n — log log n) 
C(n log n)" 1 . 
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Similarly, by taking A := log ( a) we get 



logEe A(Wl+W2) < (u-^ — ^ (logn + loglogn + log(i + 10), 

and by Markov's inequality we have 

PCHi + U 2 > S(log n + log log n)) < Ee A(Wl+%) exp(- Aa(log n + log log n)) 

< exp ( (a — — — -)(logn + loglogn + logd + 10) 

exp ^— Slog(- 3?) (log n + loglogn^ 

= ^xp((S-^-Slog(^S 

exp (log n + log log n)) 
= C exp (— 2(logn + loglogn)) 
= C(n log n)~ 2 , 

as required. 



□ 



4 Proof of the lower bound 

In this section we present the proof of the lower bound for Theorem [TJ 
For e > (small enough) we define the function f e : IR + — > K as follows 

fe(a) := ^( {d _%*_ e f )-^-<) + 1 L ^ (15) 
= /(a) + a(e-log(l-e)). (16) 

Let a* and 2 e be respectively the unique solutions to f e (a) = and f e (a) = 1 for a > 
Note that a* < a* , a e < a, and furthermore, a* —> a* and a t — > a as e — > 0. 

To prove the lower bound, it suffices to show that for all e > 0, there exist w.h.p. a vertex 
a such that 

K(l,a)| > a*logn, 
and there exists w.h.p. two vertices u and v such that 

|7r(u, u)| > a e log n. 
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For a path 7^ = vq, e±, v%, . . . , ei,vi where and are endpoints of ei for all 16 [/], let 

I 

i=l 

We first show that given that a path v) between u and i> has small weight, it is very likely 
to be the shortest-weight path between its endpoints. More precisely, we have the following. 

Lemma 9. For all n sufficiently large, and any path 7^ = Vo,ei,vx, . . . ,eki v k with k = 
O(logra), we have (for all e > 0) 

P(7fc / ir(v ,v k ) I 10(7*.) < j^ l ogn) = o(l). 

For the sake of readability, we postpone the proof of the lemma to the end of this section. 
Consider a path 7; = Vq, e±,Vi, . . . , ej, vi. It is easily seen that for t > 0, letting Po(t) denote a 
Poisson mean t random variable, we have 

t e 

P(to(7«) < t) =P(Po(t) >£)> exp(-i)- = exp(-t + £\ogt-log£l) . (17) 

In the following, we let £ = £ e be large enough such that (by Stirling formula) 

log^! <£]og£-£(l-e). 

Thus we have for a > 0, 



{ £ fl-e , }(d-2)a\ 

= exp V"«V^ + og V^^J" a( ' 

= exp(-^ f-^ + / e (a) + cdog(d-l)JJ . (18) 
We get for a = a* in $18]) (since / e (a*) = 0) 

K^^^-^M^)' (19) 

and for a = a e in (fT8j) (since / € (a e ) = 1) 
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Lemma 10. Assume t n = [clog?iJ for some positive constant c. For any function u(n) tending 
to oo with n, w.h.p., there exists v £ dB(l,t n ) such that 

where 71(f) denote the path from 1 to v in B(l,t n ). 

Proof. We first prove the lemma for the case c < 51og ^_ 1 ^ . 

Consider now B(l, [clognj) for c < 51og ^ rf _ 1 ^ . By Lemma H] w.h.p. tx(J3(l, [clognj) < 1, 
and then by removing at most one of the children of 1 (and its descendants) we have the tree 
structure and then, \dB(l, |_clograJ)| > (d — l)L clo § n J. i n the following we assume that one of 
the children of node 1 is removed (even if tx(B(l, [clognj) = 0) such that \dB(l, [clog raj )| = 
(d- l)L cl °g"J. 

Let to = l°gd-i logw(n). Note that for any path j to of length to, by Markov inequality 



6wn 



Thus, by union bound, the probability that this would be true for one of the nodes at level to 
of node 1 (i.e., in dB(l,to)) is smaller than 



(d-1 



)to t = logo;(n)log d _ 1 loga;(n) 
eoj(n) eoj(n) 



which goes to zero as n goes to 00. Then w.h.p. the path from the root (1) to all nodes at level 
to has weight smaller that eoj{n). 

We assume i = £ t is large enough such that exp ^ {d—2)a* ) ^ ^" ^ ow consider the following 
branching process starting from a node r at level to, i.e., r £ dB(l,to). 

We call a vertex v good if either v is the root (y = r), or if v lies t levels below a good vertex 
u and 10(71 (it, v)) < /Jl 2 )1* ' wnere 71(^1 v ) denote the path from u to v (in B(l, [clognj)). 

The collection of good nodes form a Galton- Watson tree. Let Z denote the progeny distribution 
of this process. Without need to calculate its distribution, from (|19|) we know that 

EZ = (i - 1)'P (w( 7( ) < ) > exp (^1-;) > 1. 

Hence, with some positive probability q e this process survives. We conclude with probability at 
least q t we have a good node at level [c log raj from the root r at level to ■ Considering the same 
process for all nodes at level t , we conclude that there exists a good vertex at level [clog raj, 
with probability at least (by independence of these processes) 

1 - (1 - q^-^ = 1 - (1 - q e ) lo ^W -». 1, 
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as n — > oo. Then w.h.p. we have a node v at level t n , such that 

This completes the proof of lemma for the case c < L 5 iog(cf 1) -I • 

Now consider the case c > 51og ^_ 1 ^ , and let K be an integer such that c' := c/K < 51og ( rf _ 1 ) ■ 
By previous argument, we know that w.h.p. there exists a node v\ at level [c'lognj such that 
10(71 (i>i)) < ^ c |^2) a* ^ ew ( n )- We know repeat the same argument to find a node vi at 
level [c'logjij below of node v\ such that 10(71 (^i, ^2)) < — ^2) a* ^ + eoJ ( n )i where ji(u,v) 
denote the path from u to v on B(l,t). Note that the tree excess is again at most one, and the 
number of nodes at level to of node v± is at least (d — 2){d — l)* " 1 which goes to infinity as 
n — > 00, and we have the similar arguments. Now repeating this process K— \ times completes 
the proof. □ 

Thus, by above lemma, there exists w.h.p. a node a at level a*logn such that 10(71 (a)) < 
2p§ log n - By Lemma [U this path is optimal. We conclude w.h.p. there exists a node a such 
that 7r(l,a) > a*logn. 

We now prove that there exists w.h.p. two vertices u and v such that 

\tt(u, v)\ > a e log n. 



Indeed, we prove that there exists w.h.p. a path 7 of length a e log n such that 10(7) < ^4 log n. 
Then again using Lemma [9l we conclude the proof. 

Consider the following exploration process starting from a node a. We call a vertex v, a-good 
if either v is the root (v = a), or if v lies I levels below a good vertex u and w{"f a {u., v)) < ^ 2 )s 1 
where "y a (u,v) denote the path from u to v in B(a, .). 

To find the nodes which are a-good, we first explore the nodes in B(a,£), and we find the set 
of nodes at this level which are a-good. Then, for each of these (a-good) nodes, we explore 
again I level behind and we continue the exploration until finding all of the a-good nodes. Let 
us denote by G^(a) the explored graph (starting from a) to find the set of all a-good nodes. 

The following lemma bounds from above the size of Gi(a). 

Lemma 11. Let G ~ G(n,d) for some fixed d > 3. Then there exists a constant A such that 
w.h.p., \Gi(u)\ < ^llogn for all u G V(G). 

The proof of this lemma is given at the end of this section. Hence, we can assume Gg(u) < 
A log ra for all u € V(G) in the rest of the proof. 

We now call a vertex u nice if Gg{u) is a tree and the height of Gi(u), denoted by Dg{u), 
is at least S e logn, i.e., De(u) > S e logn. 
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Note that when u is nice, then there exists a node v at level a e log n behind u such that 
w(j u (v)) < ^5§logn, where j u (v) denote the path from u to v in B(u,.). Using the second 
moment method, we now prove that there exists at least one nice vertex. 

Let iV a denote the event that node a is nice, and X = X^ a e[n] l(-^a) be the total number of 
nice vertices. We now show that X > 1 w.h.p., which concludes the proof. 

Let Z be the distribution of the number of a-good nodes at level I in (d — l)-array tree having 
a as a root. Conditioning on the tree structure of Gg{a) and by removing one of the children 
of a (and all its descendants), the set of a-good nodes are distributed as a branching process 
with distribution Z. Note that by (|20|) . we have 

EZ-li- 1)«P (-(„) < > exp ((-1 + ^) A) . (21) 

Let Pk(a) be the probability that this branching process survives for at least k generations. By 
basic recurrent argument, we have 

P fc+ i(a) = l-$z(l-P fc (a)), 

where = Ks z denote the generation function of Z. 

Note that EZ < 1 (for e small enough) and the branching process is subcritical. Hence, 
P k (a) -> as k -> oo. Using 1 - $ z (l - x) = & z (l)x + 0(x 2 ), and $' z (l) = EZ, it follows 
easily that 

P fc (a) = (EZ + o(l)) fc , as fc -»• oo. (22) 

Thus, conditioning on the tree-structure of Ge(a) (and by choosing k = logn), we get 

P(D e (a) > 3 e logn) > (1 ±o(l))n~ 1+ ^. 

Since the size of Gi(a) is (w.h.p.) smaller that A logn (by Lemma [TT1) . with probability at least 
1 — 0(logn/n), Gg(a) is a tree. 



Putting all these together, we have 



And, 



EX = ^P(iV a ) > \n^. 



EX 2 = E(^l(Ay) 2 = E^l(iV a )l(iV fe ) 



a.b 



E 



E 1 ^) E w + E Wa)im 

b: G e (a)nG e (b)^tD a,b: G e (a)nG i{b)=% 

< (Alogn) 2 EX + (EX) 2 , 
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where the last inequality follows by Lemma [TT1 We conclude that 

Var[X] = EX 2 - (EX) 2 < (Alogn) 2 KX. 

Then, by Chebysev's inequality w.h.p. X > ^n d ~ 2 . 
This completes the proof of the lower bound. 

We end this section by presenting the proof of Lemma [9] and Lemma [TT1 

Proof of Lemma\Q We condition on the path 7^. between vq and v^. We now remove the path 
7/c and consider the exploration process defined is Section [2.41 starting from vq. (The proof is 
similar to \13\ Lemma 3.5].) 

Let Ti denote the time of the z'th exploration step (for i > 0, tq = 0). Note that Ti+i — 73 > s t Yi, 
where Yi are independent exponential random variables with 

E[Yi\ = (l + (d-2)(i + l))" 1 . 

Note that this is true since the worst case is when Xi(a) = 0, i.e., the explored set forms a tree. 

We let z = l\/n/ log nj . We will show later that the growing balls in the exploration process 
starting from vq and Vk will not intersect w.h.p. provided that they are of size less than z. We 

1-e 
2(d-2) 

We have 



now prove that r 2 > 9 hj 9 \ log n with high probability. 



r z <t) < f f[[l + (d-2)i]e-ZLi(i+(d-2)i)xid Xl ...dx z 

= f f[[l + (d-2)i}e-^e- ( - d - 2 ^>=i yi dy 1 ...dy z 



I0<yi<-<y z <t i=1 

^-1 



where y^ = Yli=o x z~i- Letting y = y z and accounting for all permutations over yi, ■ ■ ■ ,y z -i 
(by giving to these variables the range [0,y]), we obtain 

{z - } ~ Jo (z-l)\ 

■ [I (d- 2) z e-^ y*d Vl . . . dy z - x ) dy 



< J^- {d - 1)y U %%T 2] ■ ( II fid-2)e-^d yi ) dy 



< C(d-2)z^ j\-( d -^ y (l-e-( d - 2 ^y 1 dy, 



where C > is an absolute constant. Now using the fact that (l - e^* -2 )") < e -«", for 

1-6 

2(d-2) 



some a > and for all < y < h\\ log n =: to, we obtain 
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Similarly considering the exploration process for Vk, again after time to, we obtain w.h.p. a 
set of size at most z. Now remark that, because each matching is uniform among the remaining 
half-edges, the probability of hitting the ball of size to around vq is at most z/n. Altogether, 

1 - e z 2 
F(7fc / 7r(t> ,ffc) I w (.7k) < jZT2 l ° gn ^ ~ + = 

as desired. □ 



Proof of Lemma Let Zg(a) denote the number of a-good nodes in Bi(a) (i.e., the nodes 

l(l-«0 
(d-2)S e 



in generation i behind a with (weighted) distance smaller than f rom °0- By Markov 



inequality and from (|20p . we obtain 

W(Zt(a) > 1) < EZ e (a) 



< did-lY'^d-iy'expl (-1 + 



d- 2 J a 



! ex P f(-l + T ^U]=:& 



d — 1 \\ d — 2 J a t 

(this follows from the fact that the worst case is when Bi(a) forms a tree). 
Thus, for t = £ e large enough, we have P(Z^(a) > 1) < /3 e < 1. 
We conclude (for any integer K) 

P (g*(o) < Kd(d - l)' -1 ) < /3f . 

Now by choosing K = 2 log n/| log /3 e | , we get 

Pfe(a) < 2d(d- l) i-1 Iogn/|log/9 e |) < n~ 2 . 

Taking a union bound over all a finishes the proof. 



□ 
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